Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification

نویسندگان

  • Josué Fredes
  • José Novoa
  • Víctor Poblete
  • Simon King
  • Richard M. Stern
  • Néstor Becerra Yoma
چکیده

In this paper the performance of a new feature set, Locally Normalized Cepstral Coefficients (LNCC) is evaluated for a speaker verification task with short testing utterances in additive noise. The results presented here show that LNCC outperforms baseline MFCC features when SNR is lower than 15 dB. The average relative reduction in EER achieved by LNCC is 33%. The use of LNCC in combination with spectral subtraction provides a reduction in EER averaging 18% when compared to MFCC features also with spectral subtraction. In addition, sub-band LNCC is proposed to improve the estimation of noise energy and hence the effectiveness of spectral subtraction. When compared with MFCC features, the use of sub-band LNCC led to greater reductions in EER than LNCC with non-stationary noise.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cosine distance features for robust speaker verification

We use similarities with people we know already as a means to enhance the speaker verification accuracy. Motivated by this, we use cosine distance similarities with a set of reference speakers, cosine distance features (CDF), to improve the performance of speaker verification systems for clean and additive noise test conditions. We used mel frequency cepstral coefficients, power normalized ceps...

متن کامل

Robust speaker recognition based on high order cumulant

LP-derived cepstral coefficients are sensitive to additive noise in speech signal. In this paper, an approach to extracting speech feature based on the high-order cumulant is proposed to depress the effect of additive noise in speech signal. The performance of this approach is evaluated using a text-prompt speaker verification system. Experimental results show that this approach is effective to...

متن کامل

The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms

We describe the ability of LNCC features (Locally Normalized Cepstral Coefficients) to improve speaker recognition accuracy in highly reverberant environments. We used a realistic test environment, in which we changed the number and nature of reflective surfaces in the room, creating four increasingly reverberant times from approximately 1 to 9 seconds. In this room, we re-recorded reverberated...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Improving robustness to compressed speech in speaker recognition

The goal of this paper is to analyze the impact of codecdegraded speech on a state-of-the-art speaker recognition system and propose mitigation techniques. Several acoustic features are analyzed, including the standard Mel filterbank cepstral coefficients (MFCC), as well as the noise-robust medium duration modulation cepstrum (MDMC) and power normalized cepstral coefficients (PNCC), to determin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015